Fast Metagenomic Binning via Hashing and Bayesian Clustering
نویسندگان
چکیده
منابع مشابه
Metagenomic binning through low density hashing
Bacterial microbiomes of incredible complexity are found throughout the world, from exotic marine locations to the soil in our yards to within our very guts. With recent advances in Next-Generation Sequencing (NGS) technologies, we have vastly greater quantities of microbial genome data, but the nature of environmental samples is such that DNA from different species are mixed together. Here, we...
متن کاملLow-Density Locality-Sensitive Hashing Boosts Metagenomic Binning.
Metagenomic binning is an essential task in analyzing metagenomic sequence datasets. To analyze structure or function of microbial communities from environmental samples, metagenomic sequence fragments are assigned to their taxonomic origins. Although sequence alignment algorithms, such as BWA, Bowtie or BLAST, can readily be used and usually provide high-resolution alignments and accurate binn...
متن کاملEfficient Clustering of Metagenomic Sequences using Locality Sensitive Hashing
The new generation of genomic technologies have allowed researchers to determine the collective DNA of organisms (e.g., microbes) co-existing as communities across the ecosystem (e.g., within the human host). There is a need for the computational approaches to analyze and annotate the large volumes of available sequence data from such microbial communities (metagenomes). In this paper, we devel...
متن کاملFast Segmentation via Randomized Hashing
This paper describes a feature based approach to segmenting images into coherent regions. The method draws inspiration from earlier work on randomized projection schemes for approximate nearest neighbor computation. The method proceeds by first computing a descriptor vector for each of the pixels in the image. These vectors are then randomly hashed to yield binary vectors. Salient clusters in t...
متن کاملAccurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes.
Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or 'bin' sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Computational Biology
سال: 2018
ISSN: 1557-8666
DOI: 10.1089/cmb.2017.0250